Benford’s Law, Families of Distributions and a Test Basis
نویسندگان
چکیده
Benford's Law is used to test for data irregularities. While novel, there are two weaknesses in the current methodology. First, test values used in practice are too conservative and the test values of this paper are more powerful and hold for fairly small samples. Second, testing requires Benford's Law to hold, which it often does not. I present a simple method to transform distributions to satisfy the Law with arbitrary precision and induce scale invariance, freeing tests from the choice of units. I additionally derive a rate of convergence to Benford's Law. Finally, the results are applied to common distributions.
منابع مشابه
Survival Distributions Satisfying Benford’s Law
Hill stated that “An interesting open problem is to determine which common distributions (or mixtures thereof) satisfy Benford’s law . . .”. This article quantifies compliance with Benford’s law for several popular survival distributions. The traditional analysis of Benford’s law considers its applicability to datasets. This article switches the emphasis to probability distributions that obey B...
متن کاملStigler’s approach to recovering the distribution of first significant digits in natural data sets
Benford’s Law can be seen as one of the many first significant digit (FSD) distributions in a family of monotonically decreasing distributions. We examine the interrelationship between Benford and other monotonically decreasing distributions such as those arising from Stigler, Zipf, and the power laws. We examine the theoretical basis of the Stigler distribution and extend his reasoning by inco...
متن کاملApplication of Benford’s Law in Analyzing Geotechnical Data
Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...
متن کاملBenford’s Law: An Empirical Investigation and a Novel Explanation
This report describes an investigation into Benford’s Law for the distribution of leading digits in real data sets. A large number of such data sets have been examined and it was found that only a small fraction of them conform to the law. Three classes of mathematical model of processes that might account for such a leading digit distribution have also been investigated. We found that based on...
متن کاملEvaluation of Large-scale Data to Detect Irregularity in Payment for Medical Services
Background: Sophisticated anti-fraud systems for the healthcare sector have been built based on several statistical methods. Although existing methods have been developed to detect fraud in the healthcare sector, these algorithms consume considerable time and cost, and lack a theoretical basis to handle large-scale data. Objectives: Based on mathematical theory, this study proposes a new approa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010